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COLON SPECIFIC GENE ikND PROTEIN 

This invention relates to newly identified 
polynucleotides, polypeptides encoded by such 
polynucleotides, and the use of such polynucleotides and 
polypeptides. The present invention further relates to 
inhibiting the production and function of the polypeptides of 
the present invention. 

The gastrointestinal tract is the most common site of 
both newly diagnosed cancers and fatal cancers occurring each 
year in the USA, figures are somewhat higher for men than for 
women. The incidence of colon cancer in the USA is 
increasing, while that of gastric cancer is decreasing, 
cancer of the small intestine is rcure. The incidence of 
gastrointestinal cancers varies geographically. Gastric 
cancer is common in Japan and uncommon in the United States, 
whereas colon cancer is uncommon in Japan and common in the 
USA. An environmental etiologic factor is strongly suggested 
by the statistical data showing that people who move to a 
high-risk area assume the high risk. Some of the suggested 
etiologic factors for gastric cancer include aflatoxin, a 
carcinogen formed by aspergillus flavus and present in 
contaminated food, smoked fish, alcohol, and Vitamin A and 
magnesitan deficiencies. A diet high in fat and low in bulk, 
and, possibly, degradation products of sterol metabolism may 
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be the etiologic factors for colon cancer. Certain disorders 
may predispose to Ccincer, for example, pernicious anemia to 
gastric cancer, untreated non- tropical sprue and immune 
defects to lymphoma and carcinoma, and ulcerative and 
granulomatous colitis, isolated polyps, and inherited 
familial polyposis to carcinoma of the colon. 

The most common tumor of the colon is adenomatous polyp. 
Primary lynphoma is rare in the colon and most common in the 
small intestine. 

Adenomatous polyps are the most common benign 
gastrointestinal tumors. They occur throughout the 61 txact, 
most commonly in the colon and stomach, and are found more 
frequently in males than in females. They may be single, or 
more commonly, multiple, and sessile or pedunculated. They 
may be inherited, as in familial polyposis cmd Gardener's 
syndrome, which primarily involves the colon. Development of 
colon cancer is common in familial polyposis. Polyps often 
cause bleeding, which may occult or gross, but rarely cause 
pain unless coiqplications ensue. Papillary adenoma, a less 
common form found only in the colon, may sLLso cause 
electrolyte loss and mucoid discharge. 

A malignant tumor includes a carcinoma of the colon 
which may be izif iltrating or exophytic and occurs most 
commonly in the rectosigmoid. Because the content of the 
ascending colon is liquid, a carcinoma in this area usually 
does not cause obstruction, but the patient tends to present 
late in the course of the disease with anemia, abdominal 
pain, or an abdominal mass or a palpable mass. 

The prognosis with colonic timiors depends on the degree 
of bowel wall invasion and on the presence of regional lymph 
node involvement and distant metastases. The prognosis with 
carcinoma of the rectum and descending colon is quite 
une3q>ectedly good. Cure rates of 80 to 90% are possible with 
early resection before nodal invasion develops. For this 
reason, great care must be taken to exclude this disease when 
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unexplained anemia, occult gastrointestinal bl eding, or 
change in bowel habits develop in a previously healthy 
patient. Complete removal of the lesion before it spreads to 
the lynph nodes provides the best chance of survival for a 
patient with Ccuicer of the colon. Detection in an 
asymptomatic patient by occult -bleeding, blood screening 
results in the highest five year survival. 

Clinically suspected malignant lesions can usually be 
detected radiologically. Polyps less than 1 cm can easily be 
missed, especially in the tqpper sigmoid and in the presence 
of diverticulosis . Clinically suspected and radiologically 
detected lesions in the esophagus, stomach or colon can be 
confirmed by fiber optic endoscopy combined with histologic 
tissue diagnosis made by directed biopsy and brush sitology. 
Colonoscopy is another method utilized to detect colon 
diseases. Benign and malignant polyps not visualized by X- 
ray are often detected on colonoscopy. In addition, patients 
with one lesion on X-ray often have additional lesions 
detected on colonoscopy. Sigmoidoscope examination, however, 
only detects about 50% of colonic tumors. 

The above methods of detecting colon cancer have 
drawbacks, for exaxople, small colonic tumors may be missed by 
all of the cibove-described methods. The importance of 
detecting colon cancer is also extremely important to prevent 
metastases . 

In accordance with an aspect of the present invention, 
there are provided nucleic acid probes comprising nucleic 
acid molecules of sufficient length to specifically hybridize 
to the RNA transcribed from the human colon specific gene of 
the present invention or to DNA corresponding to such RNA. 

In accordance with another aspect of the present 
invention there is provided a method of and products for 
diagnosing colon cancer metastases by determining the 
presence of RNA transcribed from the human colon specific 
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gene of the present invention or DNA corresponding to such 
RNA in a sample derived from a host. 

In accordance with yet another aspect of the present 
invention, there is provided a method of and products for 
diagnosing colon cancer metastases by detecting an altered 
level of a polypeptide corresponding . to the colon specific 
gene of the present invention in a sainple derived from a 
host, whereby an elevated level of the polypeptide indicates 
a colon cancer diagnosis. 

In accordance with another aspect of the present 
invention, there are provided isolated polynucleotides 
encoding a polypeptide of the present invention, including 
mRNAs, DNAs, cDNTVs, genomic DNAs, as well as antisense. 
analogs and biologically active and diagnostically or 
therapeutically useful fragments thereof. 

In accordance with a further aspect of the present 
— invention, there are provided novel polypeptide encoded by 
the polynucleotides, as well as biologically active and 
diagnostically or therapeutically useful fragments, analogs 
and derivatives thereof. 

In accordance with yet a further aspect of the present 
invention, there is provided a process for producing such 
polypeptides by recombinant techniques comprising culturing 
recombinant prokaryotic and/or etikaryotic host cells, 
containing a polynucleotide of the present invention, under 
conditions promoting esqpression of said proteins and 
subsequent recovery of said proteins. 

In accordance with yet a further aspect of the present 
invention, there are provided antibodies specific to such 
polypeptides . 

In accordance with another aspect of the present 
invention, there are provided processes for using the 
polypeptides of the present invention to treat colon cancer 
and for using the polypeptide to screen for confounds which 
interact with the polypeptides, for exanple, coapounds which 
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inhibit or activate the receptor for the polypeptide of the 
present invention. 

In accordance with yet another aspect of the present 
invention, there is provided a process to screen for 
compounds which interact with the polypeptides, for exanple, 
confounds which inhibit or activate the polypeptides of the 
present invention. 

In accordance with yet a further aspect of the present 
invention, there are provided processes for utilizing such 
polypeptides , or polynucleotides encoding such polypeptides , 
for in vitro purposes related to scientific research, 
synthesis of DNA and manufacture of HNA vectors. 

These and other aspects of the present invention should 
be apparent to those skilled in the art from the teachings 
herein. 

The following drawings are illustrative of embodiments 
of the invention and are not meant to limit the scope of the 
invention as enconqpassed by the claims » 

Figure 1 shows the cDNA sequence and the corresponding 
deduced amino acid sequence for the human colon specific gene 
disclosed in this application. The standard one-letter 
cQDbreviations for amino acids are used. 

The term "colon specific gene" means that such gene is 
primaxily expressed in tissues derived from the colon, and 
such gene may be e3q>ressed in cells derived from tissues 
other than from the colon. However, the expression of such 
gene is significantly higher in tissues derived from the 
colon than from non-colon tissues. 

In accordance with an aspect of the present invention, 
there are provided isolated nucleic acids (polynucleotides) 
which encode for the mature polypeptide having the deduced 
amino acid sequence of Figure 1 (SEQ ID NO: 2) or for the 
mature polypeptide encoded by the cDNA of the clone (s) 
deposited as ATCC Deposit No. 97129 on April 28, 1995. 
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A polynucleotide encoding the colon specific gene of the 
present invention was isolated from a human colon cancer cDNA 
library* The polynucleotide contains cin open reading frame 
encoding a protein of 158 amino acid residues. The 
polypeptide exhibits structural homology to a galactose 
specific lectin from a diamondback rattlesnake with 36% 
identity cuid 54% similarity over a 125 amino acid stretch and 
30% identity and 52% similarity to a hximan pancreatic stone 
protein precursor. 

The polynucleotides of the present invention may be in 
the form of RMA or in the form of DNA, which DNA includes 
cDNA, genomic DNA, and synthetic DNA. The DNA may be double - 
strcuided or single-stranded, and if single stremded may be 
the coding strauid or non-coding (anti-sense) strand. The 
coding sequence which encodes the mature polypeptides may be 
identical to the coding sequence shown in Figures 1 (SEQ ID 
N0:1) or that of the deposited clone (s) or may be a different 
coding sequence which coding sequence, as a result of the 
redundancy or degeneracy of the genetic code, encodes the 
same mature polypeptides as the DNA of Figure 1 (SEQ ID N0:1) 
or the deposited cDNA.. 

The polynucleotides which encode for the mature 
polypeptide of Figure 1 (SEQ ID NO: 2) or for the mature 
polypeptide encoded by the deposited cDNA may include: only 
the coding sequence for the mature polypeptide; the coding 
sequence for the mature polypeptide and additional coding 
sequence such as a leader or secretory sequence or a 
proprotein sequence; the coding sequence for the mature 
polypeptide (and optionally additional coding sequence) and 
non-coding sequence, such as introns or non-coding sequence 
5' emd/or 3' of the coding sequence for the mature 
polypeptides . 

Thus, the term "polynucleotide encoding a polypeptide" 
encompasses a polynucleotide which includes only coding 
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sequence for the polypeptide as well as a polynucleotide 
which includes additional coding and/or non-coding sequence. 

The present invention further relates to variants of the 
hereinabove described polynucleotides which encode for 
fragments, analogs and derivatives of the polypeptide having 
the deduced amino acid sequence of Figure 1 (SBQ ID NO: 2) or 
the polypeptide encoded by the cDNA of the deposited 
clone (s). The variants of the polynucleotides may be a 
naturally occurring allelic variant of the polynucleotides or 
a non-naturally occurring variant of the polynucleotides. 

Thus, the present invention includes polynucleotides 
encoding the same mature polypeptide as shown in Figures 1 
(SEQ ID NO: 2) or the same mature polypeptide encoded by the 
cDNA of the deposited clone (s) as well as varicuits of such 
polynucleotides which variants encode for a fragment, 
derivative or analog of the polypeptide of Figure 1 (SEQ ID 
NO: 2) or the polypeptide encoded by the cDNA of the deposited 
clone (s). Such nucleotide variants include deletion 
variants, substitution variants and addition or insertion 
variants . 

As hereinabove indicated, the polynucleotides may have 
a coding sequence which is a naturally' occurring allelic 
variant of the coding sequence shown in Figure 1 (SEQ ID 
N0:1) or of the coding sequence of the deposited clone (s)* 
As known in the art, an allelic variant is an alternate form 
of a polynucleotide sequence which may have a sxibstitution, 
deletion or addition of one or more nucleotides, which does 
not substantially alter the function of the encoded 
polypeptide . 

The present invention also includes polynucleotides, 
wherein the coding secpience for the mature polypeptides may 
be fused in the same reading frame to a polynucleotide 
sequence which aids in expression and secretion of a 
polypeptide from a host cell, for example, a leader sequence 
which functions as a secretory sequence for controlling 
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transport of a polypeptide from the cell. The polypeptide 
having a leader sequence is a preprotein and may have the 
leader sequence cleaved by the host cell to form the mature 
form of the polypeptide . The polynucleotides may also encode 
for a proprotein which is the mature protein plus additional 
5' amino acid residues* A raatxire protein having a 
prosequence is a proprotein and is an inactive form of the 
protein. Once the prosec[uence is cleaved an active mature 
protein remains. Thus, for example, the polynucleotides of 
the present invention may encode for a mature protein, or for 
a protein having a prosequence or for a protein having both 
a prosequence and a presequence (leader sequence) . 

The polynucleotides of the present invention may also 
have the coding sequence fused in frame to a marker sequence 
which allows for purification of the polypeptide of the 
present invention. The marker sequence may be a hexa* 
histidine tag supplied by a pQE-9 vector to provide for 
purification of the mattire polypeptide fused to the marker in 
the case of a bacterial host, or, for example, the mcirker 
seqpience may be a hemagglutinin (HA) tag when a mammalian 
host, e.g. COS -7 cells, is used. The HA tag coinresponds to 
an epitope derived from the influenza hemagglutinin protein 
(Wilson, I., et al.. Cell, 37:767 (1984)). 

The term "gene" means the segment of DNA involved in 
producing a polypeptide chain; it includes regions preceding 
and following the coding region (leader and trailer) as well 
as intervening sequences (introns) between individual coding 
segments (exons) . 

Fragments of the full length colon specific gene may be 
used as a hybridization probe for a cDNA library to isolate 
the full length gene and to isolate other genes which have a 
high sequence similarity to the gene or similar biological 
activity. Probes of this type preferably hav^ at least 30 
bases and may contain, for example, 50 or more bases. The 
probe may also be used to identify a cDNA clone corresponding 
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to a full length transcript and a genomic clone or clones 
that contain the conplete colon specific gene including 
regulatory and promotdr regions , exons , and introns . An 
example of a screen comprises isolating the coding region of 
the gene by using the known DNA sequence to synthesize an 
oligonucleotide probe. Labeled oligonucleotides having a 
sequence complementsay to that of the gene of the present 
invention are used to screen a library of hiuocui cDNA, genomic 
DNA or mRNA to determine which members of the libraiY the 
probe hybridizes to. 

The present invention further relates to 
polynucleotides which hybridize to the hereinabove -described 
sequences if there is at least 70%, preferably at least 90%, 
and more preferably at least 95% identity between the 
sequences. The present invention particularly relates to 
polynucleotides which hybridize under stringent conditions to 
— the hereinabove -described polynucleotides. As herein used, 
the term "stringent conditions" means hybridization will 
occur only if there is at least 95% and preferably at least 
97% identity between the sequences. The polynucleotides 
which hybridize to the hereinabove described polynucleotides 
in a preferred embodiment encode polypeptides which either 
retain substantially the same biological function or activity 
as the mature polypeptide encoded by the cDNAs of Figure 1 
(SBQ ID N0:1) or the deposited cDNA(s) . 

Alternatively, the polynucleotide may have at least 20 
bases, preferably 30 bases, and more preferably at least 50 
bases which hybridize to a polynucleotide of the present 
invention and which has an identity thereto, as here inab ove 
described, and which may or may not retain activity. For 
exairgple, such polynucleotides may be employed as probes for 
the polynucleotide of Figure 1 (SEQ ID NO:l) , for example, 
for recovery of the polynucleotide or as a diagnostic probe 
or as a PGR primer. 
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Thus, the present invention is directed to 
polynucleotides having at least a 70% identity, preferably at 
least 90% and more preferably at least a 95% identity to a 
polynucleotide which encodes the polypeptide of Figure 1 (SEQ 
ID NO: 2) as well as fragments thereof, which fragments have 
at least 30 bases and preferably at least 50 bases and to 
polypeptides encoded by such polynucleotides. 

The deposit (s) referred to herein will be maintained 
under the terms of the Budapest Treaty on the International 
Recognition of the Deposit of Micro-organisms for purposes of 
Patent Procedure, These deposits are provided merely as 
convenience to those of skill in the art and cure rot an 
admission that a deposit is required under 35 U.S.C. §112. . 
The sec[uence of the polynucleotides contained in the 
deposited materials, as well as the amino acid sequence of 
the polypeptides encoded thereby, are incorporated herein by 
reference and are controlling in the event of any conflict 
with any description of sequences herein. A license may be' 
required to make, use or sell the deposited materials, and 
no such license is hereby granted. 

In accordance with another aspect of the present 
invention there are provided polynucleotides which are 
preferably at least 10 base pairs in length and which 
hybridize to and cure at least 70% identical to RNA (or 
corresponding DNA) trcuiscribed from a human gene which has a 
coding sequence which is at least 90% identical to coding 
sequence of the DNA sequence of Figure 1 (SEQ ID N0:1) . 

Thixs, the polynucleotide sequences which hybridize as 
described above may be used to hybridize to and detect the 
expression of the human gene to which they correspond for use 
in diagnostic assays as hereinafter described. 

In accordance with still another aspect of the present 
invention there are provided diagnostic assays for detecting 
micrometastases of colon cancer in a host. While applicant 
does not wish to limit the reasoning of the present invention 
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to any specific scientific theory, it is believed that the 
presence of active transcription of a colon specific gene of 
the present invention in cells of the host, other than those 
derived from the colon, is indicative of colon cancer 
metastases. This is true because, while the colon specific 
gene are foimd in all cells of the body, their transcription 
to mRNA., cDNA and expression products is primarily limited to 
the colon in non-diseased individuals. However, if colon 
cancer is present, colon cancer cells migrate from the ccuicer 
to other cells, such that these other cells are now actively 
transcribing and e^^ressing a colon specific gene at a 
greater level than is normally found in non-diseased 
individuals, i.e., transcription is higher than foimd in non- 
colon tissues in healthy individuals. It is the detection of 
this enhanced trsuiscription or enhanced protein e^cpression in 
cells, other than those derived from the colon, which is 
indicative of metastases of colon cancer. 

In one example of such a diagnostic assay, cm SNA 
sequence in a sample derived from a tissue other than the 
colon is detected by hybridization to a probe. The sample 
contains a nucleic acid or a mixture of nucleic acids, at 
least one of which is suspected of containing RNA (or 
corresponding cDNA.) trcuiscribed from a human colon specific 
gene of the present invention. Thus, for exanqple, in a form 
of an assay for determining the presence of a specific RNA in 
cells, initially UNA is isolated from the cells. 

A sanple may be obtained from cells derived from tissue 
other than from the colon including but not limited to blood, 
urine, saliva, tissue biopsy auad autopsy material. The use 
of such methods for detecting enhanced transcription to mRNA 
from a human colon specific gene of the present invention or 
fragment thereof in a sanqole obtained from cells derived from 
other than the colon is well within the scope of those 
skilled in the art from the teachings herein. 
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The isolation of mRNA comprises isolating total cellnleir 
RNA by disinipting a cell and performing differential 
centrifugation. Once the total RNA is isolated, ttiRNA is 
isolated by making use of the adenine nucleotide residues 
known to those skilled in the art as a poly (A) tail found on 
virtually every eukaryotic mHNA molecule at the 3' end 
thereof. Oligonucleotides composed of only deoxythymidine 
[oligo (dT) ] are linked to cellulose and the oligo(dT)- 
cellulose packed into small columns. When a preparation of 
total cellular RNA is passed through such a column, the mRNA 
moleciiles bind to the oligo (dT) by the poly (A) tails while the 
rest of the RNA flows through the colimm. The bound miRKTls 
are then eluted from the column and collected. 

One exanple of detecting isolated mRNA transcribed from 
a colon specific gene of the present invention or a fragment 
thereof which encodes for a polypeptide of the present 
invention , comprises screening the collected mRNAs with 
specific oligonucleotide probes which have been custom 
designed to hybridize to the mRNA to be detected. The 
oligonucleotide probe cooprises a polynucleotide sequence 
which hybridizes to at least a portion of the mRNA (or cDNA 
produced from such RNA) transcribed from one or more of the 
colon specific gene of the present invention or fragment 
thereof. The polynucleotide sequences are at least 70% 
identical to and hybridize to mRNA (or cDNA produced from 
such RNA) transcribed from a humcoi colon specific gene of the 
present invention having exons which includes DNA having at 
least 90%, preferably at least 95% identity cuid most 
preferably at least 97% identity to the DNA sequence of 
Figure 1 (SEQ ID N0:1) . 

It is also appreciated that such probes can be and are 
preferably labeled with an analytically detectable reagent to 
facilitate identification of the probe. Useful reagents 
include but are not limited to radioactivity, fluorescent 
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dyes or enzymes capable of catalyzing the formation of a 
detectable product. 

An example of detecting a polynucleotide complementary 
to the mRNA s guence (cDNA) utilizes the polymezrase chai n 
reaction (PGR) in conjunction with reverse transcriptase. 
PGR is a very powerful method for the specific amplification 
of DNA or RNA stretches (Saiki et al.. Nature, 234:163-166 
(1986)). One application of this technology is in nucleic 
acid probe technology to bring up nucleic acid sequences 
present in low copy numbers to a detectable level. Numerous 
diagnostic and scientific applications of this method have 
been described by H.A. Brlich (edj in PGR Technology- 
Principles euid ;4>plications for DMA Anqplif ication, Stockton 
Press, USA, 1989, and by M.A. Inis (ed.) in PGR Protocols, 
Academic Press, San Diego, USA, 1990* 

RT-PGR is a combination of PGR with the reverse 
transcriptase enzyme . Reverse transcriptase is an enzyme 
which produces cDNA molecules from corresponding mRNA 
molecules. This is important since PGR anplifies nucleic 
acid molecules, particularly DNA, and this DNA may be 
produced from the mRNA isolated from a sample derived from 
the host. 

A specific example of an RT-PCR diagnostic assay 
involves removing a sample from a tissue of a host. Such a 
saiople will be from a tissue, other than the colon, for 
example, blood. Therefore, em excunple of such a diagnostic 
assay comprises whole blood gradient isolation of nucleated 
cells, total RNA extraction, RT-PGR of total RNA and agairose 
gel electrophoresis of PGR products. The PGR products 
comprise cDNA conplementary to RNA transcribed from the colon 
specific gene of the present invention or fragments thereof* 
More particularly, a blood san^le is obtained and the whole 
blood is combined with an equal volume of phosphate buffered 
saline, centrifuged and the lyn5}hocyte and granulocyte layer 
is carefully aspirated and rediluted in phosphate buffered 
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saline cuid centrifuged again. The supemate is discarded and 
the pellet containing nucleated cells is used for RNA 
extraction using the RNazole B method as described by the 
manufacturer (Tel -Test Inc., Friendswood, TX) . 

Oligonucleotide primers and probes are prepared with 
high specificity to the DNA sequences of the present 
invention. The probes are at least 10 base pairs in length, 
pretexably at least 30 base pairs in length and may be at 
least 50 base pairs in length or more* The reverse 
transcriptase reaction and PGR amplification are performed 
sequentially without interruption. Taq polymerase is used 
during PGR and the PGR products are concentrated and the 
entire sample is run on a Tris -borate -EDTA agarose gel 
containing ethidixmi bromide. 

In accordance with another aspect of the present 
invention, there is provided a method of diagnosing a 
— disorder of the colon, for example colon cancer, by 
determining alterecl levels of the colon specific polypeptides 
of the present invention in a biological saii^)le, derived from 
tissue other than from the colon. Elevated levels of the 
colon specific polypeptides of the present invention, 
indicates active transcription and expression of the 
corresponding colon specific gene product. Assays used to 
detect levels of a colon specific gene polypeptide in a 
sanqple derived from a host are well-known to those skilled in 
the art and include radioimmunoassays, conqpetitive -binding 
assays. Western blot analysis, ELISA assays and "sandwich" 
assays. A biologiCcLL sample may include, but is not limited 
to, tissue extracts, cell samples or biological fluids, 
however, in accordance with the present invention, a 
biological sample specifically does not include tissue or 
cells of the colon. 

An ELISA assay (Coligan, et al.. Current ^Protocols in 
Immunology . 1(2), Qiapter 6, 1991) initially coirqprises 
preparing an antibody specific to a colon specific 
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polypeptide of the present inventi n, preferably a monoclonal 
antibody. In addition, a reporter antibody is prepared 
against the monoclonal antibody. To the reporter antibody is 
attached a detectable r agent such as radioactivity, 
fluorescence or, in this exaxtple, a horseradish peroxidase 
enzyme* A sanple is removed from a host and inciibated on a 
solid support, e.g., a polystyrene dish, that binds the 
proteins in the sanple. Any free protein binding sites on 
the dish are then covered by incubating with a non-specific 
protein, such as BSA. Next, the monoclonal antibody is 
inciibated in the dish during which time the monoclonal 
antibodies attach to the colon specific polypeptide attached 
to the polystyrene dish. All unbotxnd monoclonal antibody is 
washed out with buffer. The reporter antibody linked to 
horseradish peroxidase is now placed in the dish resulting in 
binding of the reporter antibody to any monoclonal antibody 
bound to the colon specific gene polypeptide. Uoattached 
reporter antibody is then washed out. Peroxidase^ substrates 
are then added to the dish and the amount of color developed 
in a given time period is a measurement of the amoxmt of the 
colon specific polypeptide present in a given volume of 
patient sample when compared against a standard curve. 

A conopetition assay may be enployed where antibodies 
specific to a colon specific polypeptide are attached to a 
solid support. The colon specific polypeptide is then 
labeled euid the labeled polypeptide a sample derived from the 
host are passed over the solid support and the amount of 
label detected, for example, by liquid scintillation 
chromatography, can be correlated to a quantity of the colon 
specific polypeptide in the sample. 

A "sandwich" assay is similar to an ELISA assay. In a 
"sandwich" assay, colon specific polypeptides cure passed over 
a solid support and bind to antibody attached to the solid 
support. A second antibody is then bound to the colon 
specific polypeptide. A third antibody which is labeled and 
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is specific to the second antlix>dy, is then passed over the 
solid support and binds to the second antibody and cin amount 
can then be quantified. 

In alternative loethods, labeled antibodies to a colon 
specific polypeptide cure used. In a one -step assay, the 
target molecxile, if it is present, is immobilized and 
incubated with a labeled antibody. The labeled antibody 
binds to the immobilized target molecule. After washing to 
remove the unboimd molecules, the saxiple is assayed for the 
presence of the label. In a two-step assay, immobilized 
target molecule is incubated with an unl abeled antibody. The 
target molecule -labeled antibody conplex, if present, is then 
boiind to a second, Isibeled antibody that is specific for the 
unlabeled antibody. The sanple is washed and assayed for the 
presence of the label. 

The choice of marker used to label the antibodies will 
vary depending i^>on the application. However, the choice of 
marker is readily determinable to one skilled in the art. 
These labeled cuitibodies may be used in immunoassays as well 
as in histological applications to detect the presence of the 
proteins. The labeled antibodies may be polyclonal or 
monoclonal . 

The presence of active transcription, which is gpreater 
them that normally found, of the colon specific gene in cells 
other than from the colon, by the presence of an altered 
level of mRNA, cDHA or escpression products is an important 
indication of the presence of a colon cancer which has 
metastasized, since colon cancer cells are migrating from the 
colon into the general circulation- Accordingly, this 
phenomenon may have important clinical inplications since the 
method of treating a localized, as opposed to a metastasized, 
ttonor is entirely different. 

The assays described above may also be used to test 
whether bone marrow preserved before chemotherapy is 
contaminated with micrometastases of a colon cancer cell. In 
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the assay, blood cells from the bone marrow are Isolated and 
treated as described above, this method allows one to 
determine whether preserved bone marrow is still suitable for 
transplantation after chemotherapy. 

Antibodies specific to the colon specific polypeptide, 
for exanple monoclonal antibodies, may also be used to target 
colon cancer cells, for example, in a method of homing 
interaction agents which, when contacting colon cancer cells, 
destroy them. This is true since the antibodies are specific 
for colon specific polypeptides which are primarily expressed 
in the colon, and a linking of the interaction agent to the 
antibody would cause the interaction agent to be Ccirried 
directly to the prostate. 

Antibodies of this type may also be used to do in vivo 
imaging, for exanple, by labeling the antibodies to 
facilitate scanning of the pelvic area and the colon. One 
method for imaging conprises contacting any tumor cells of 
the colon to be imaged with an anti -colon specific antibody 
labeled with a detectable marker. The method is performed 
under conditions such that the labeled antibody binds to any 
colon specific polypeptides. In a specific example, the 
antibodies interact with the colon, for exanqple, colon cancer 
cells, and fluoresce t^n such contact such that imaging and 
visibility of the colon is enhanced to allow a determination 
of the diseased or non-diseased state of the colon. 

The present invention f\irther relates to a colon 
specific gene polypeptide which has the deduced amino acid 
sequence of Figure 1 (SEQ ID NO: 2) or which have the amino 
acid sequences encoded by the deposited cDNA(s) , as well as 
fragments, analogs and derivatives of such polypeptide. 

The tezTOs "fragment," "derivative" and "analog" when 
referring to the polypeptide of Figure l (SEQ ID NO: 2) or 
that encoded by the deposited cDNA{s) , means a polypeptide 
which retains essentially the same biological function or 
activity as such polypeptide. Thus, an analog includes a 
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proproteln which can be activated by cleavage of the 
proproteln portion to produce an active mature polypeptide. 

The polypeptides of the present Invention may be 
recombinant polypeptides, natural polypeptides or synthetic 
polypeptides, preferably recombinant polypeptides. 

The fragment, derivative or analog of the polypeptides 
of Figure 1 (SEQ ID NO: 2) or that encoded by the deposited 
cDNA(s) may be (1) one In which one or more of the amino acid 
residues are substituted with a conserved or non-conserved 
amino acid residue (preferably a conserved amino acid 
residue) and such substituted amino acid residue may or may 
not be one encoded by the genetic code, or (11) one In which 
one or more of the amino acid residues Includes a substltuent 
group, or (111) one In which the polypeptide Is fused with 
another compound, such as a conpotuid to Increase the half- 
llfe of the polypeptide (for example, polyethylene glycol) , 
or (Iv) one In which the additional amino acids are fused to 
the polypeptide, such as a leader or secretory sequence ox a 
sequence which Is eoqployed for purification of the mature 
polypeptide or a proproteln sequence . Such fragments , 
derivatives and cuialogs are deemed to be within the scope of 
those skilled In the art from the teachings herein. 

The polypeptides and polynucleotides of the present 
Invention are preferably provided In an Isolated form, Euid 
preferably are purified to homogeneity. 

The term "Isolated" means that the material Is removed 
from Its original environment (e.g., the natural environment 
If It Is naturally occtirrlng) . For example, a naturally- 
occuarrlng polynucleotide or polypeptide present In a living 
cuxlmal Is not Isolated, but the same polynucleotide or 
polypeptide, separated from some or all of the coexisting 
materials In the natural system. Is Isolated. Such 
polynucleotides could be part of a vector and/or such 
polynucleotides or polypeptides could be pairt of a 
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cooposition, and still be isolated in that such vector or 
composition is not part of its natural environment. 

The polypeptides of the present invention include the 
polypeptide of Figure 1 (SEQ ID NO: 2) {in particular the 
mature polypeptide) as well as polypeptides which have at 
least 70% similarity (preferably at least 70% identity) to 
the polypeptide of Figure 1 (SEQ ID NO: 2) and more preferably 
at least 90% simileurity (more prefeirably at least 90% 
identity) to the polypeptide of Figure 1 (SEQ ID N0:2) and 
still more preferably at least 95% similcurity (still more 
preferably at least 90% identity) to the polypeptide of 
Figtire 1 (SEQ ID NO: 2) and also include portions of such 
polypeptides with such portion of the polypeptide generally 
containing at least 30 amino acids and more preferably at 
least 50 amino acids. 

As known in the art "similarity" between two 
polypeptides is determined by comparing the amino acid 
sequence and its conserved amino acid substitutes of one 
polypeptide to the sequence of a second polypeptide. 

Fragments or portions of the polypeptides of the present 
invention may be en?>loyed for producing the corresponding 
full-length polypeptide by peptide synthesis; therefore, the 
fragments may be employed as intermediates for producing the 
fxxll- length polypeptides. Fragments or portions of the 
polynucleotides of the present invention may be used to 
synthesize fxill-length polynucleotides of the present 
invention. 

The present invention also relates to vectors which 
include polynucleotides of the present invention, host cells 
which are genetically engineered with vectors of the 
invention and the production of polypeptides of the invention 
by recombinant techniques. 

Host cells are genetically engineered (transduced or 
treuisformed or trcuisfected) with the vectors of this 
invention which may be, for example, a cloning vector or an 
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expression vector. The vector may be, for exaraple, in the 
form of a plasmid, a viral particle, a phage, etc* The 
engineered host cells can be cultxired in conventional 
nutrient media modified as appropriate for activating 
promoters, selecting transformants or amplifying the colon 
specific genes. The culture conditions, such as temperature, 
pH and the like, are those previously used with the host cell 
selected for expression, and will be apparent to those of 
ordinarily skill in the art. 

The polynucleotides of the present invention may be 
employed for producing polypeptides by reco m bi n a n t 
techniques. Thus, for exanple, the polynucleotide may be 
included in any one of a variety of expression vectors for 
expressing a polypeptide. Such vectors include chromosomal, 
nonchromosOTial and synthetic DNA sequences, e.g. , derivatives 
of SV40; bacterial plasmids; phage DNA; baculovirus; yeast 
plasmids; vectors derived from combinations of plasmids and 
phage DNA., viral DNA such as vaccinia,' adenovirus, fowl pox 
virus, and pseudorabies . However, any other vector may be 
used as long as it is replicable cind viable in the host. 

The appropriate DNA sequence may be inserted into the 
vector by a variety of procedures. In general, the DNA 
sec[uence is inserted into an appropriate restriction 
endonuclease site(s) by procedures known in the art. Such 
procedures and others cure deemed to be within the scope of 
those skilled in the art. 

The DNA sequence in the expression vector is operatively 
linked to an appropriate expression control sequence (s) 
(promoter) to direct mRNA synthesis. As representative 
exanqples of such promoters, there may be mentioned: LTR or 
SV40 promoter, the E. coli. lac or trp, the phage laxobda Pl 
promoter cuid other promoters known to control expression of 
gene in prokaryotic or eukaryotic cells or tlieir viruses. 
The expression vector also contains a ribosome binding site 
for translation initiation and a transcription ter m i na tor. 
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The vector may also include appropriate sequences for 
anplifying expression. 

In addition, the expression vectors preferably contain 
one or more selectable marker gene to provide a phenotypic 
trait for selection of transformed host cells such as 
dihydrof olate reductase or neonrycin resistance for etikaryotic 
cell culture, or such as tetracycline or ainpicillin 
resistance in E . coli . 

The vector containing the appropriate DNA sequence as 
hereinabove described, as well as an appropriate promoter or 
control sequence, may be enployed to transform an appropriate 
host to permit the host to express the protein. 

As representative examples of appropriate hosts, there 
may be mentioned: bacterial cells, such as E . coli . 
Streptomyces . Salmonella ^yp>Himiyi nfn> fungal cells, such as 
yeast; insect cells such as Drosoohila 32 and Spodoptera Sf9; 
animal cells such as CHO, COS or Bowes melanoma ; 
adenoviruses; plant cells, etc. The selection of cui 
appropriate host is deemed to be within the scope of those 
skilled in the art from the teachings heredLn. 

More particixlarly, the present invention also includes 
recombinant constructs conprising one or more of the 
sequences as broadly described above. The constructs 
comprise a vector, such as a plasmid or viral vector, into 
which a sequence of the invention has been inserted, in a 
forward or reverse orientation. In a preferred aspect of 
this embodiment, the construct further comprises regulatory 
sequences, including, for exanple, a promoter, operably 
linked to the sequence. Large numbers of suitable vectors 
and promoters are known to those of skill in the art, and are 
commercially available. The following vectors are provided 
by way of example. Bacterial: pQE70, pQEGO, pQE-9 (Qiagen) , 
pBS, pDlO, phagescript, psiX174, pbluescript SK, pBSKS, 
pNH8A, pNHlSa, pNHlSA, pNH46A (Stratagene) ; ptrc99a, pKK223- 
3, pKK233-3, pDR540, pRIT5 (Pharmacia). Bukaryotic: pWLNEO, 
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pSV2CAT, pOG44, pXTl, pSG (Stratagene) pSVK3, pBPV, pMSG, 
pSVL (Pharmacia) . However , any other plasmid or vector may 
be used as long as they are replicable and viable in the 
host. 

Promoter regions can be selected from any desired gene 
using CAT (chloranqphenicol transferase) vectors or other 
vectors with selectcible markers. Two appropriate vectors are 
pKK232-8 and pCM7. Particular named bacterial promoters 
include lad, lacZ, T3, T7, gpt, lambda Pr, Pl and trp. 
Eukaryotic promoters include a4V immediate early, HSV 
thymidine kinase, early and late SV40, LTRs from retrovirus, 
cuid mouse metallothionein-I. Selection of the appropriate 
vector and promoter is well within the level of ordinaury. 
skill in the art. 

In a further embodiment, the present invention relates 
to host cells containing the above-described constructs. The 
host cell can be a higher eiikaryotic cell, such as a 
mammalian cell, or a lower eukaryotic cell, such as a yeast 
cell, or the host cell can be a prokaryotic cell, such as a 
bacterial cell. Introduction of the construct into the host 
cell can be effected by calcitna phosphate transfection, DEAE- 
Dextran mediated trcinsfect ion, or electroporation (Davis, L., 
Dibner, , Battey, I., Basic Methods in Molecular Biology, 
(1986) ) . 

The constructs in host cells can be used in a 
conventional manner to produce the gene product encoded by 
the recombinant sequence. Alternatively, the polypeptides of 
the invention can be synthetically produced by conventional 
peptide synthesizers. 

Proteins can be expressed in mammalian cells, yeast, 
bacteria, or other cells under the control of appropriate 
promoters. Cell-free translation systems can also be 
enployed to produce such proteins using RNAs derived from the 
DNA. constructs of the present invention. J^propriate cloning 
and e^qpression vectors for use with prokaryotic and 
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eiikaryotic hosts are described by Sainbrook, et al., Moleciilar 
Cloning: A Laboratory Manual, Second Edition, Cold Spring 
Harbor, N.Y., (1989), the disclosure of which Is hereby 
Incorporated by reference. 

Transcription of the DNA encoding the polypeptides of 
the present invention by higher eukaryotes Is Increased by 
Inserting an enhancer sequence Into the vector. Enhancers 
are els -acting elements of DNA, usiially about from 10 to 300 
bp that act on a promoter to Increase Its transcription. 
Bxairqples Including the SV40 enhancer on the late side of the 
replication origin bp 100 to 270, a cytomegalovirus early 
promoter enhancer, the polyoma enhancer on the late side of 
the replication origin, and adenovirus enhancers. 

Generally, recombinant expression vectors will Include 
origins of replication and selectable markers permitting 
transformation of the host cell, e.g., the anqQlclllln 
resistance gene of E . coll and S . cerevlslae TRPl gene, and 
a promoter derived from a highly-expressed gene to direct 
transcription of a downstream stiructural sequence. Such 
promoters can be derived from operons encoding glycolytic 
enzymes such as 3 -phosphoglycerate kinase (P6K) , a- factor, 
acid phosphatase, or heat shock proteins, among others. The 
heterologous structiiral sequence Is assembled In appropriate 
phase with translation Initiation and termination sec[uences. 
Optionally, the heterologous sequence can encode a fusion 
protein Including an N-termlnal Identification peptide 
Imparting desired characteristics, e.g., stabilization or 
slnpllfled purification of expressed recombinant product. 

Useful esqpresslon vectors for bacterial use are 
constructed by Inserting a structural DMA secjuence encoding 
a desired protein together with suitable translation 
Initiation and termination signals In operable reading phase 
with a functional promoter. The vector will comprise one or 
more phenotyplc selectable markers and an origin of 
replication to ensure maintenance of the vector and to. If 
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desirable, provide amplification within the host. Suitable 
prokaryotic hosts for transformation include B , coli . 
Bacillus subtilis . Salmonella typhimurium and various species 
within the genera Pseudomonas, Streptomyces , and 
Staphylococcus, although others may also be enployed as a 
matter of choice. 

As a representative but nonlimiting exazrple, useful 
expression vectors for bacterial use can conprise a 
selectable marker and bacterial origin of replication derived 
from commercially available plasmids coirqprising genetic 
elements of the well known cloning vector pBR322 (ATCC 
37017) . Such commercial vectors include, for exanple, 
PKK223-3 (Pharmacia Fine Oiemicals, qppsala, Sweden) and GEMl 
(Promega Biotec, Madison, WI, USA) . These pBR322 "backbone" 
sections are combined with an appropriate promoter and the 
structural sequence to be expressed. 

Following transformation of a suitable host strain and 
growth of the host strain to an appropriate cell density, the 
selected promoter is induced by appropriate means (e.g., 
temperature shift or chemical induction) and cells are 
cultured for an additional period. 

Cells are typically harvested by centrifugation, 
disrupted by physical or chemical means, and the resxilting 
crude extract retained for further purification. 

Microbial cells enployed in expression of proteins can 
be disarupted by any convenient method, including freeze- thaw 
cycling, sonication, mechanical disruption, or use of cell 
lysing agents , such methods are well know to those skilled in 
the art. 

Various mammalian cell cultxire systems can also be 
employed to express recombinant protein. Exasples of 
mammalian esqpression systems include the COS -7 lines of 
monkey kidney fibroblasts, described by Gluzman, Cell, 23:175 
(1981) , and other cell lines capable of expressing a 
compatible vector, for example, the C127, 3T3, CHO, HeLa and 
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BHK cell lines. Mammalian expression vectors will comprise 
cm origin of replication, a suitable promoter and enhancer, 
and also any necessary ribosome binding sites, 
polyadenylation site, splice donor cind acceptor sites, 
transcriptional termination sequences, cuid 5' flanking 
nontranscribed sequences. DWi sequences derived from the 
SV40 splice, and polyadenylation sites may be used to provide 
the required nontranscribed genetic elements. 

The colon specific gene polypeptide can be recovered and 
purified from recombinant cell cultures by methods including 
ammonium sulfate or ethanol precipitation, acid extraction, 
anion or cation exchange chromatography, phosphocellulose 
chromatography, hydrophobic interaction chromatography, 
affinity chromatography, hydroxylapatite chromatography and 
lectin chromatography. Protein refolding steps can be used, 
as necessary, in conpleting configuration of the mature 
protein. Finally, high performance liquid chromatography 
(HPLC) can be enployed for final purification steps. 

The polynucleotides of the present invention may have 
the coding sequence fused in frame to a marker secjuence which 
allows for purification of the polypeptide of the present 
invention. An example of a marker sequence is a hexa- 
histidine tag which may be sillied by a vector, preferably 
a pQE-9 vector, which provides for purification of the 
polypeptide fused to the marker in the case of a bacterial 
host, or, for exanqple, the marker sequence may be a 
hemagglutinin (HA.) tag when a mammalian host, e.g. COS -7 
cells, is used. The HA tag corresponds to cm epitope derived 
from the influenza hemagglutinin protein (Wilson, I., et al., 
Cell, 37:767 (1984)) . 

The polypeptide of the present invention may be a 
naturally purified product, or a product of chemical 
synthetic procedures, or produced by recombinant techniques 
from a prokaryotic or exikaryotic host (for exanple, by 
bacterial , yeast , higher plant , insect and mammalian cells in 
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culture) . Depending upon the host enqployed In a reco m b in a n t 
production procedure, the polypeptide of the present 
invention may be glycosylated or may be non-glycosylat d. 
Polypeptides of the invention may also include an initial 
methionine amino acid residue. 

In accordance with another aspect of the present 
invention there are provided assays which may be used to 
screen for therapeutics to inhibit the action of the colon 
specific gene or colon specific protein of the present 
invention. One assay takes advantage of the reductase 
ftmction of these proteins. The present invention discloses 
methods for selecting a therapeutic which forms a complex 
with colon specific gene proteins with sufficient affinity to 
prevent their biological action. The methods include various 
assays, including conqpetitive assays where the proteins are 
immobilized to a support, and are contacted with a natural 
substrate and a labeled therapeutic either simultaneously or 
in either consecutive order, and determining whether the 
therapeutic effectively conqpetes with the natural substrate 
in a manner sufficient to prevent binding of the protein to 
its sxibstrate. 

In another embodiment, the substrate is immobilized to 
a support, cmd is contacted with both a labeled colon 
specific polypeptide and a therapeutic (or unlabeled proteins 
and a labeled therapeutic) , and it is determined whether the 
amoimt of the colon specific polypeptide bound to the 
substrate is reduced in comparison to the assay without the 
therapeutic added. The colon specific polypeptide may be 
labeled with antibodies. 

In another exattqple of such a screening assay, there is 
provided a mammalian cell or membrane preparation expressing 
a colon specific polypeptide of the present invention 
incubated with elements which undergo simultanepus oxidation 
and reduction, for example hydrogen and oxygen which together 
form water, wherein the hydrogen could be labeled by 
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radioactivity, e.g., tritium, in the presence of the compound 
to be screened under conditions favoring the oxidation 
reduction reaction where hydrogen and oxygen form water. The 
ability of the coiopound to block this interaction coixld then 
be measured. 

This invention provides a method for identification of 
the receptors for the polypeptide of the present invention. 
The gene encoding the receptor can be identified by numerous 
methods known to those of skill in the art, for example, 
ligand panning and FACS sorting (Coligan, et al.. Current 
Protocols in Imtmin, , 1(2), Chapter 5, (1991)). Preferably, 
expression cloning is employed wherein polyadenylated RNA. is 
prepared from a cell responsive to the polypeptides, and a 
cDNA library created from this RNA is divided into pools and 
used to trcuisf ect COS cells or other cells that are not 
responsive to the polypeptides. Transfected cells which are 
— grown on glass slides are exposed to the labeled 
polypeptides. The polypeptides can be labeled by a variety 
of means including iodination or inclusion of a recognition 
site for a site-specific protein kinase. Following fixation 
and inciibation, the slides are subjected to autoradiographic 
analysis. Positive pools are identified and sxib-pools are 
prepared and retreuisf ected using an iterative sub-pooling and 
rescreening process, eventually yielding a single clone (s) 
that encodes the putative receptor. 

As an alternative approach for receptor identification, 
the labeled polypeptides can be photoaff inity linked with 
cell membrane or extract preparations that express the 
receptor molecule. Cross -linked material is resolved by PA(3E 
analysis euid es^osed to X-ray film. The labeled complex 
containing the receptors of the polypeptides can be excised, 
resolveid into peptide fragments, and subjected to protein 
microsequencing. The amino acid sequence obtained from 
microseguencing would be used to design a set of degenerate 
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Oligonucleotide probes to screen a cDNA libraary to identify 
the genes encoding the putative receptors. 

Further, since the colon specific gene and gene product 
of the present invention is a growth regulator agonist eind 
cuxtagonists to the polypeptide could be determined by an 
assay comprising combining membrcuxe preparations coaiprising 
the receptor for the polypeptide and a compound to be 
screened and determining the generation of a signal from the 
receptor. In the case of determining an antagonist, the 
polypeptide of the present invention is added to the assay 
and the ability of the compoxind to compete for receptor 
sites, i.e., lack of generation of signal from receptor, 
could then be determined. 

Potential antagonists to a colon specific polypeptide 
include antibodies and ant i- idiotypic antibodies as described 
above, or in some cases, an oligonucleotide, which binds to 
the polypeptide. 

Another potential antagonist is an antisense construct 
prepared using antisense technology, which is directed to a 
colon specific polynucleotide to prevent transcription. 
Antisense technology can be used to control gene expression 
through triple -helix formation or antisense ENA or RNA, both 
of which methods are based on binding of a polynucleotide to 
DNA or RNA. For example, the 5' coding portion of the 
polynucleotide sequence, which encodes for the mature 
polypeptides of the present invention, is used to design an 
antisense RNA oligonucleotide of from about 10 to 40 base 
pairs in length. A DNA oligonucleotide is designed to be 
conplementary to a region of the gene involved in 
transcription (triple helix -see Lee et al., Nucl. Acids 
Res., 6:3073 (1979); Cooney et al. Science, 241:456 (1988); 
and Dervan et al.. Science, 251: 1360 (1991)), thereby 
preventing transcription and the production of a colon 
specific polynucleotide. The antisense RNA oligonucleotide 
hybridizes to the mRNA in vivo and blocks translation of the 
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inRNA molecule into the colon specific gene polypeptide 
(cuitiBense - Okano, J» Neurochem. , 56:560 (1991); 
Oligodeoxynucleotides as Ant i sense Inhibitors of Gene 
B3q>ression, CRC Press, Boca Raton, FL (1988)). The 
oligonucleotides described above can also be delivered to 
cells such that the antisense RNA or DNA may be expressed in 
vivo to inhibit production of the colon specific 
polypeptides . 

Potential cuitagonists also include a small molecule 
which binds to and occupies the active site of the colon 
specific polypeptide thereby malting the active site 
inaccessible to substrate such that normal biological 
activity is prevented. Exanples of small molecules include 
but are not limited to small peptides or peptide -like 
molecules . 

The antagonists may be employed to treat colon cancer, 
since they dLnteract with the function of colon specific 
polypeptides in a manner sufficient to inhibit natural 
function which is necessary for the viability of colon Ccuicer 
cells. The antagonists may be employed in a composition with 
a pharmaceutically acceptable carrier, e.g., as hereinafter 
described. 

The polypeptides euid antagonists of the present 
invention may be employed in combination with a suitable 
pharmaceutical carrier. Such compositions comprise a 
therapeutically effective amount of the polypeptide or 
antagonist, and a pharmaceutically acceptable carrier or 
excipient. Such a carrier includes but is not limited to 
saline, buffered saline, dextrose, water, glycerol, ethanol, 
and combinations thereof. The formulation should suit the 
mode of administration. 

The invention also provides a pharmaceutical pack or kit 
comprising one or more containers filled with one or more of 
the ingredients of the pharmaceutical coapositibns of the 
invention. Associated with such container (s) can be a notice 
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In the form prescribed by a governmental agency regulating 
the manufacture I use or sale of pharmaceuticals or biological 
products, which notice reflects approval by the agency of 
manufacture I use or sale for human administration. In 
addition, the pharmaceutical compositions may be etiployed in 
conjunction with other therapeutic conqpounds. 

The pharmaceutical congpositions may be administered in 
a convenient manner such as by the oral, tqpical, 
intravenous , intraperitoneal , intramusculcir , subcutcmeous , 
intrcuiasal, intra-anal or intradermal routes. The 
pharmaceutical conpositions are administered in an amount 
which is effective for treating and/or prophylaxis of the 
specific indication. In general, they are administered in an 
amount of at least about 10 fig/l^s body weight and in most 
cases they will be administered in an amount not in excess of 
about 8 mg/Kg body weight per day. In most cases, the dosage 
is from about 10 /ig/kg to about 1 mg/kg body weight daily, 
taking into accotmt the routes of administration, synptoms, 
etc . 

The colon specific gene polypeptides and antagonists and 
agonists which are polypeptides may also be enqployed in 
accordance with the present invention by expression of such 
polypeptides in vivo, which is often referred to as "gene 
therapy . " 

Thus, for exanqple, cells from a patient may be 
engineered with a polynucleotide (DNA or RNA) encoding a 
polypeptide ex vivo, with the engineered cells then being 
provided to a patient to be treated with the polypeptide. 
Such methods are well-lmown in the art. For example, cells 
may be engineered by procedures known in the art by use of a 
retroviral particle containing RNA encoding a polypeptide of 
the present invention. 

Similarly, cells may be engineered in vivo for 
expression of a polypeptide in vivo by, for exanqple, 
procedures known in the art. As known in the art, a producer 
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cell for producing a retroviral particle containing RNA 
encoding a polypeptide of the present invention may be 
administered to a patient for engineering cells in vivo and 
expression of the polypeptide in vivo. These and other 
methods for administering a polypeptide of the present 
invention by such method should be apparent to those skilled 
in the art from the teachings of the present invention. For 
exanple, the expression vehicle for engineering cells may be 
other than a retrovirus, for exanqple, an adenovirus which may 
be used to engineer cells in vivo after combination with a 
siiitcible delivery vehicle. 

Retroviruses from which the retroviral plasmid vectors 
hereinabove mentioned may be derived include, but are not 
limited to, Moloney Murine Leukemia Virus, spleen necrosis 
virus, retroviruses such as Rous Sarcoma Virus, Harvey 
Sarcoma Virus, avian leukosis virus, gibbon ape leukemia 
vims, h\mian immunodeficiency virus, adenovirus. 
Myeloproliferative Sarcoma Virus, and mammary txamor virus. 
In one embodiment, the retroviral plasmid vector is derived 
from Moloney Murine Leukemia Virus. 

The vector includes one or more promoters. Suitable 
promoters which may be enployed include, but are not limited 
to, the retroviral LTR; the SV40 promoter; and the human 
cytomegalovirus (CMV) promoter described in Miller, et al., 
Biotechnioues , Vol. 7, No. 9, 980-990 (1989), or any other 
promoter (e.g., cellular promoters such as eukaryotic 
cellular promoters including, but not limited to, the 
his tone, pol III, and jS-actin promoters) . Other viral 
promoters which may be employed include, but are not limited 
to, adenovirus promoters, thymidine kinase (TK) promoters, 
and B19 parvovirus promoters. The selection of a suitable 
promoter will be appsurent to those skilled in the art from 
the teachings contained herein. 

The nucleic acid secjuence encoding the polypeptide of 
the present invention is under the control of a suitable 
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promoter. Suitable promoters which may be enployed Include, 
but are not limited to, adenoviral promoters, such as the 
adenoviral major late promoter; or hetorologous promoters, 
such as the cytomegalovirus (CMV) promoter; the respiratory 
syncytial virus (RSV) promoter; Inducible promoters, such as 
the MMT promoter, the metallothloneln promoter; heat shock 
promoters; the albumin promoter; the ApoAI promoter; human 
globln promoters; viral thymidine kinase promoters, such as 
the Herpes Sinqplex thymidine kinase promoter; retroviral LTRs 
(including the modified retroviral LTRs hereinabove 
described) ; the iS-actin promoter; and human growth hormone 
promoters. The promoter also may be the native promoter 
which controls the gene encoding the polypeptides. 

The retroviral plasmid vector is enployed to transduce 
packaging cell lines to form producer cell lines. Examples 
of packaging cells which may be transf ected Include, but are 
- not limited to, the PE501, PA317, ^-2, ^-AM, PA12, T19-14X, 
VT-19-17-H2, ^CRB, \^C31IP, GP+B-86, GP+envAml2, and DAN cell 
lines as described in Miller, Human Gene Therapy . Vol. 1, 
pgs. 5-14 (1990), which is incorporated herein by reference 
in its entirety. The vector may transduce the packaging 
cells through any means known in the art. Such means 
include, but are not limited to, electroporation, the use of 
liposomes, and CaP04 precipitation. In one alternative, the 
retroviral plasmid vector may be encapsulated into a 
liposome, or cot^led to a lipid, and then administered to a 
host . 

The producer cell line generates infectious retroviral 
vector peirticles which include the nucleic acid sequence (s) 
encoding the polypeptides. Such retroviral vector particles 
then may be euployed, to trauisduce eukaryotic cells, either 
in vitro or in vivo. The transduced eukaryotic cells will 
e3cpress the nucleic acid sequence (s) ^coding the 
polypeptide. Eukaryotic cells which may be transduced 
include, but are not limited to, embryonic stem cells. 
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embryonic carcinoma cells, as well as hematopoietic stem 
cells, hepatocytes, fibroblasts, nyoblasts, keratinocytes , 
endothelial cells, and bronchial epithelial cells. 

This invention is also related to the use of a colon 
specific gene of the present invention as a diagnostic. For 
example, some diseases result from inherited defective genes. 
The colon specific gene of the present invention is 
overexpressed in colon cancer. A mutation in a colon 
specific gene of the present invention at the DNA level may 
be detected by a variety of techniques. Nucleic acids used 
for diagnosis (genomic DNA, mRNA, etc.) may be obtained from 
a patient's cells, other than from the colon, such sls from 
blood, urine, saliva, tissue biopsy and autopsy material. 
The genomic DNA may be used directly for detection or may be 
amplified enzymatically by using PCR (Saiki, et al.. Nature . 
324:163-166 (1986)) prior to analysis. RNA or cDNA may also 
be used for the same purpose. As an exanqple, PCR primers 
complementary to the nucleic acid of the instant invention 
can be used to identify and analyze mutations in a colon 
specific polynucleotide of the present invention. For 
exanqpie, deletions and insertions can be detected by a change 
in size of the amplified product in conqparison to the normal 
genotype. Point mutations can be identified by hybridizing 
anplified DNA to radiolabelled colon specific RNA or, 
alternatively, radiolabelled antisense DNA sequences. 

Another well-established method for screening for 
mutations in particular segments of DNA after PCR 
amplification is single-strand conformation polymorphism 
(SSCP) analysis. PCR products are prepared for SSCP by ten 
cycles of reanplif ication to incorporate ^P-dCTTP, digested 
with an appropriate restriction enzyme to generate 200-300 bp 
fragments, and denatured by heating to 85 ^C for 5 min. and 
then plunged into ice. Electrophoresis is then carried out 
in a nondenatxiring gel (5% glycerol, 5% acrylamide) (Glavac, 
D. and Dean, M., Human Mutation, 2:404-414 (1993)). 
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Sequence differences between the reference gene and 
"imitants" may be revealed by the direct DNA sequencing 
method. In addition, cloned BJXk segments may be used as 
probes to detect specific DNA segments. The sensitivity of 
this method is greatly enhanced when combined with PC3R. For 
example, a sequencing primer is used with double-stranded PGR 
product or a single-stranded template molecule generated by 
a modified PCK. The sequence determination is performed by 
conventional procedures with radiolabeled nucleotides or by 
automatic sequencing procedures with fluorescent-tags. 

Genetic testing based on DNA sequence differences may be 
achieved by detection of alteration in electrqphoretic 
mobility of DNA fragments cuid gels with or without denaturing, 
agents. Small sequence deletions and insertions can be 
visualized by high- resolution gel electrophoresis. DNA 
fragments of different sequences may be distinguished on 
denaturing f ormamide gradient gels in which the mobilities of 
different DNA fragments are retourded in the gel at different 
positions according to their specific melting or partial 
melting tenperatures (see, e.g., Myers, et al.. Science . 
230:1242 (1985)). In addition, sequence alterations, in 
particular small deletions, may be detected as changes in the 
migration pattern of DNA. 

Sequence changes at specific locations may also be 
revealed by nuclease protection assays, such as Rnase and SI 
protection or the chemical cleavage method (e.g.. Cotton, et 
al., PNAS, USA , 85:4397-4401 (1985)). 

Thus, the detection of the specific DNA sequence may be 
achieved by methods such as hybridization, RNase protection, 
chemical cleavage, direct DNA sequencing, or the use of 
restriction enzymes (e.g.. Restriction Fragment Length 
Polymorphisms (RFLP) ) and Southern blotting. 

The sequences of the present invention are also valuable 
for chromosome identification. The sequence is specifically 
targeted to and can hybridize with a paxt:icular location on 
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an individual human chromosome. Moreover, there is a current 
need for identifying particular sites on the chromosome. Few 
chromosome marking reagents based on actual sequence data 
(repeat polymorphisms) are presently available for marking 
chromosomal location. The mapping of DNAs to chromosomes 
according to the present invention is an iinportant first step 
in correlating those sec[uences with gene associated with 
disease . 

Briefly, sequences can be mapped to chromosomes by 
preparing PC31 primers (preferably 15-25 bp) from the cENA. 
Computer analysis of the 3' xintranslated region is used to 
rapidly select primers that do not span more than one exon in 
the genomic DNA, thus coii;)licating the amplification process . 
These primers are then used for PCR screening of somatic cell 
hybrids containing individual himian chromosomes. Only those 
hybrids containing the hizman gene corresponding to the primer 
will yield an amplified fragment. 

PCR mapping of somatic cell hybrids is a rapid procedure 
for assigning a particular DNA to a particular chromosome. 
Using the present invention with the same oligonucleotide 
primers, sublocalization can be achieved with panels of 
fragments from specific chromosomes or pools of large genomic 
clones in an analogovis manner. Other mapping strategies that 
can similcirly be used to map to its chromosome include in 
situ hybridization, prescreening with labeled flow-sorted 
chromosomes and preselection by hybridization to construct 
chromosome specif ic-cDNA libraries. 

Fluorescence In situ hybridization (FISH) of a cDNA 
clone to a metaphase chromosomal spread can be used to 
provide a precise chromosomal location in one step. This 
technique can be used with cDNA as short as 50 or 60 bases. 
For a review of this technique, see Verma et al.. Human 
Chromosomes : a Manual of Basic Techniques, Pergamon Press, 
New York (1988) . 
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Once a sequence has been mapped to a precise chromosomal 
location, the physical position of the sequence on the 
chromosome can be correlat d with genetic map data. Such 
data are found, for example, in V. McKusick, Mendelian 
Inheritance in Man (available on line through Johns Hopkins 
University Welch Medical Library) • The relationship between 
gene and diseases that have been mapped to . the same 
chromosomal region are then identified through linkage 
analysis (coinheritance of physically adjacent genes) . 

Next, it is necessary to determine the differences in 
the cDN& or genomic sequence between affected and unaffected 
individuals. If a mutation is observed in some or all of the 
affected individuals but not in any normal individuals, then 
the mutation is likely to be the caiisative agent of the 
disease. 

With ciirrent resolution of physical mapping and genetic 
mapping techniques, a cDNA precisely localized to a 
chromosomal region associated with the disease could be one 
of between 50 and 500 potential causative genes. (This 
assumes i megabase mapping resolution and one gene per 20 
kb) . 

The polypeptides, their fragments or other derivatives, 
or analogs thereof, or cells expressing them can be used as 
an immunogen to produce antibodies thereto. These antibodies 
can be, for exanple, polyclonal or monoclonal antibodies. 
The present invention also includes chimeric, single chain, 
cuid humanized cuitibodies, as well as Fab fragments, or the 
product of an Fab expression library. Various procedures 
known in the art may be used for the production of such 
antibodies and fragments. 

Antibodies generated against the polypeptides 
corresponding to a sequence of the present invention can be 
obtained by direct injection of the polypeptides into an 
animal or by administering the polypeptides to an animal, 
preferably a nonhuman. The antibody so obtained will then 
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bind the polypeptides itself. In this manner, even a 
sequence encoding only a fragment of the polypeptides can be 
used to generate antibodies binding the whole native 
polypeptides. Such antibodies can then be us d to isolate 
the polypeptide from tissue expressing that polypeptide. 

For preparation of monoclonal antibodies, any technique 
which provides antibodies produced by continuous cell line 
cxxltiures can be used. Exairples include the hybridoma 
technique (Kohler and Milstein, 1975, Nature, 256:495-497), 
the trioma technique, the human B-cell hybridoma technique 
(Kozbor et al., 1983, Immunology Today 4:72), and the KBV- 
hybridoma technique to produce human monoclonal cuitibodies 
(Cole, et al., 1985, in Monoclonal Antibodies and Cancer 
Therapy, Tllan R. Liss, Inc., pp. 77-96). 

Techniques described for the production of single chain 
antibodies (U.S. Patent 4,946,778) can be adapted to produce 
-single chain antibodies to immunogenic polypeptide products 
of this invention. Transgenic mice may also be used to 
generate antibodies. 

The antibodies may also be enployed to target colon 
cancer cells, for exanqple, in a method of homing interaction 
agents which, when contacting colon cancer cells, destroy 
them. This is true since the antibodies are specific for the 
colon specific polypeptides of the present invention. A 
linking of the interaction agent to the antibody would cause 
the interaction agent to be carried directly to the colon. 

Antibodies of this type may also be used to do in vivo 
imaging, for example, by Icd^eling the antibodies to 
facilitate scaiming of the pelvic area and the colon. One 
method for imaging comprises contacting any cancer cells of 
the colon to be imaged with cin anti -colon specific protein- 
antibody labeled with a detectcUble marker. The method is 
performed xinder conditions such that the labeled antibody 
binds to the colon specific polypeptides. In a specific 
example, the antibodies interact with the colon, for exanqple, 
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colon cancer cells, and fluoresce upon contact such that 
imaging and visibility of the colon are enhanced to allow a 
determination of the diseased or non-diseased state of the 
colon. 

The present invention will be further described with 
reference to the following exanples; however, it is to be 
understood that the present invention is not limited to such 
examples . All pairts or amounts, tmless otherwise specified, 
are by weight. 

m order to facilitate understanding of the following 
examples certain frequently occurring methods and/or terms 
will be described. 

"Plasmids" are designated by a lower case p preceded 
and/ or followed by capital letters and/ or numbers. The 
starting plasmids herein are either commercially available, 
piiblicly available on an unrestricted basis, or can be 
constructed from available plasmids in accord with published 
procedures. In addition, equivalent plasmids to those 
described are known in the curt and will be apparent to the 
ordinarily skilled artisan. 

"Digestion" of DNA refers to catalytic cleavage of the 
DNA with a restriction enzyme that acts only at certain 
sequences in the DNA. The various restriction enzymes used 
herein are commercially available and their reaction 
conditions, cof actors and other requirements were used as 
would be known to the ordinarily skilled artisan. For 
analytical purposes, typically 1 ng of plasmid or DNA 
fragment is used with about 2 units of enzyme in about 20 /il 
of buffer solution. For the purpose of isolating DNA 
fragments for plasmid construction, typically 5 to 50 |ig of 
Btm are digested with 20 to 250 units of enzyme in a larger 
volume. i^^propriate buffers and substrate amounts for 
pcirticular restriction enzymes cire specif j-ed by the 
manufacturer. Incubation times of about 1 hour at 37 *C are 
ordinarily used, but may vary in accordance with the 
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supplier's instructions. After digestion the reaction is 
electrophoresed directly on a polyacrylamide gel to isolate 
the desired fragment. 

Size separation of the cleaved fragments is performed 
using 8 percent polyacrylamide gel described by Goeddel, D. 
et al.. Nucleic Acids Res., 6:4057 (1980). 

"Oligonucleotides" refers to either a single stranded 
polydeoxynucleotide or two conqplementary polydeoxynucleotide 
strands which may be chemically synthesized. Such synthetic 
oligonucleotides have no 5' phosphate cuid thus will not 
ligate to another oligonucleotide without adding a phosphate 
with an ATP in the presence of a kinase. A synthetic 
oligonucleotide will ligate to a fragment that has not been 
dephosphorylat ed . 

"Ligation" refers to the process of forming 
phosphodiester bonds between two double stranded nucleic acid 
fragments (Maniatis, T., et al.. Id., p. 146). IMless 
otherwise provided, ligation may be acconqplished using known 
buffers and conditions with 10 units of T4 DNA ligase 
("ligase") per 0.5 [ig of approximately eguimolar amoiuits of 
the fragments to be ligated. 

Unless otherwise stated, transformation was performed as 
described in the method of Graham, F. and Van der Eb, A., 
Virology, 52:456-457 (1973). 

Example 1 

Determination of Transcription of the colon specific gene 

To assess the presence or absence of active 
transcription of a colon specific gene RNA, approximately 6 
ml of venous blood is obtained with a standard venipuncture 
technique using heparinized tubes . Whole blood is mixed with 
an equal volume of phosphate buffered saline, which is then 
layered over 8 ml of Ficoll (Pharmacia, I^saJ.a, Sweden) in 
a 15 -ml polystyrene tiibe. The gradient is centrifuged at 
1800 X g for 20 min at 5**C. The lymphocyte and granulocyte 
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layer (approximately 5 ml) is carefully aspirated and 
rediluted up to 50 ml with phosphate-buf f ered saline in a 50- 
ml tube, which is centrifuged again at 1800 X g for 20 min. 
at 5**C. The supernatant is disccurded and the pellet 
containing nucleated cells is used for BNA extraction using 
the RNazole B method as described by the manufacturer (Tel- 
Test Inc., Friendswood, TX) . 

To determine the quantity of mRNA from the gene of 
interest, a probe is designed with an identity to the mRNA 
sequence transcribed from a human gene whose coding portion 
includes a DNA. sequence of Figure 1. This probe is mixed 
with the extracted RNA and the mixed DNA and HNA cire 
precipitated with ethanol -70°C for 15 minutes) . The pellet 
is resuspended in hybridization buffer and dissolved. The 
tubes containing the mixture are incubated in a 72 ®C water 
bath for 10-15 mins. to denatxire the DNA. The tubes are 
rapidly transferred to a water bath at the desired 
hybridization tenperature . Hybridization tenqperature depends 
on the G + C content of the DNT^. Hybridization is done for 
3 hrs. 0.3 ml of nuclease-Sl buffer is added and mixed well. 
50 ^1 of 4.0 M ammonium acetate and 0.1 M EDTA is added to 
stop the reaction. The mixture is extracted with 
phenol/chloroform and 20 /xg of carrier tRMA is added and 
precipitation is done with an equal voltxme of isopropanol. 
The precipitate is dissolved in 40 fil of TE (pH 7.4) and run 
on an alkaline agarose gel. Following electrophoresis, the 
RNA is microsequenced to confirm the nucleotide sequence. 
(See Favaloro, J. et al.. Methods Enzymol., 65:718 (1980) for 
a more detailed review) . 

Two oligonucleotide primers are enqployed to anplify the 
sequence isolated by the above methods. The 5' primer is 20 
nucleotides long and the 3' , primer is a complimentary 
sequence for the 3' end of the isolated mRNA. The primers 
are custom designed according to the isolated mRNA. The 
reverse transcriptase reaction and PGR amplification are 
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perf oarmed sequentially without interruption in a Perkin Elmer 
9600 PGR machine (Emeryville, CA) . Pour hundred ng total RNA 
in 20 /il diethylpyrocarbonate- treated water are placed in a 
65®C water bath for 5 xain. and then quickly chilled on ice 
immediately prior to the addition of PGR reagents. The 50-/il 
total PC!R volume consisted of 2,5 units Taq polymerase 
(Perkin-Elmer) . 2 units avian nyeloblastosis virus reverse 
transcriptase (Boehringer Mannheim, Indianapolis, IN); 200 ^ 
each of dCTP, dATP, dGTP and dTTP (Perkin Elmer) ; 18 pM each 
primer, 10 mM Tris-HCl; 50 mM KCl; and 2 mM MgCl2 (Perkin 
Elmer) . PGR conditions are as follows: cycle 1 is 42® C for 
15 min then 97"*C for 15 s (1 cycle) ; cycle 2 is 95*»C for 1 
min. SO**C for 1 min, and 72**C for 30 s (15 cycles) ; cycle 3 
is 95**C for 1 min. SO'^C for 1 min., and 72**C for 1 min. (10 
cycles) ; cycle 4 is 95**C for 1 min. , 60«»C for 1 min. , and 
72«*C for 2 min. (8 cycles); cycle 5 is 72**C for 15 min. (1 
cycle) ; and the final cycle is a 4<^C hold until sample is 
taken out of the machine. The 50-fil PGR products are 
concentrated down to 10 /il with vacuum centrifugation, and a 
sanqple is then run on a thin 1.2 % Tris -borate -EDTA agarose 
gel containing ethidium bromide. A band of esqpected size 
would indicate that this gene is present in the tissue 
assayed. The amount of RNA in the pellet may be quantified 
in numerous ways, for exanqple, it may be weighed. 

Verification of the nucleotide sequence of the PGR 
products is done by microsequencing. The PGR product is 
purified with a Qiagen PGR Product Purification Kit (Qiagen, 
Ghatsworth, GA) as described by the manuf actiirer . One fig of 
the PGR product undergoes PGR sequencing by using the Taq 
DyeDeoxy Terminator Gycle sequencing kit in a Perkin-Elmer 
9600 PGR machine as described by i^lied Biosystems (Foster, 
CA) . The sequenced product is purified iising Gentri-Sep 
colxnmis (Princeton Separations, Adelphia, NJ) as described by 
the company. This product is then analyzed with an ABI model 
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373A DNA sequencing system (Applied Biosystems) integrated 
with a Macintosli Ilci conputer. 

Example 2 

Bacterial Expression and Purification of the colon specific 
gene protein and Use For Preparing a Monoclonal Antibody 

The DNA sequence encoding a polypeptide of the present 
invention, ATCC # 97129, is initially amplified using PGR 
oligonucleotide primers corresponding to the 5' sequences of 
the processed protein (minus the signal peptide sequence) and 
the vector sequences 3' to the gene. Additional nucleotides 
corresponding to the DNA sequence are added to the 5' and 3' 
seq[uences respectively. The 5' oligonucleotide primer 
GCA6(£R^TCCrG6CTTCCAGAAC3CAT6 (BAMHl) (SEQ ID N0:3) may contain, 
for example, a restriction enzyme site followed by 
nucleotides of coding sequence starting from the presxmied 
terminal amino acid of the processed protein. The 3' 
sequence TACGGGTACCTTGCTCTATGGTOSCSTAC (ASP718) (SEQ. IDN0:4) 
may, for exanple, contain coiqplementary sequences to a 
restriction enzyme site and also be followed by nucleotides 
of the nucleic acid sequence encoding the protein of 
interest. The restriction enzyme sites correspond to the 
restriction enzyme sites on a bacterial expression vector, 
for example, pQE-32 (Qiagen, Inc. Qiatsworth, CA) . PQE-32 
encodes antibiotic resistance (Amp') , a bacterial origin of 
replication (ori) , an IPTG-regiilatable promoter operator 
(P/0) , a ribosome binding site (RBS) , a 6-His tag and 
restriction enzyme sites. pQB-9 is then digested with the 
restriction enzymes corresponding to restriction enzyme sites 
contained in he primer sequences. The amplified sequences 
are ligated into pQE-9 and inserted in frame with the 
sequence encoding for the histidine tag and the RBS. The 
ligation mixture is then used to transform an E. coli strain, 
for exanple, M15/rep 4 (Qiagen) by the procedure described in 
Sambrook, J. et al., Molecular Cloning: A Laboratory Manual, 
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Cold Spring Laboratory Press, (1989) . Ml5/rep4 contains, 
multiple copies of the plasmid pR£P4, which e^^resses the 
lad repressor and also confers kanamycin resistance (Kan') . 
Transformants are identified by their ability to grow on LB 
plates and ampicillin/kanamycin resistant colonies are 
selected. Plasmid DNA is isolated and confirmed by 
restriction analysis. Clones containing the desired 
constructs are grown overnight (0/N) in liquid culture in LB 
media sirpplemented with both Amp (100 ug/ml) and Kan (25 
ug/ml) . The 0/N culture is used to inociaate a large culture 
at a ratio of 1:100 to 1:250. The cells are grown to an 
optical density 600 (O.D.*~) of between 0.4 and 0.6. IPTG 
("Isopropyl-B-D-thiogalacto pyranoside") is then added to a 
final concentration of 1 tnM. IPT6 induces by inactivating 
the lad repressor, clearing the P/O leading to increased 
gene expression. Cells are grown an extra 3 to 4 hours. 
Cells are then harvested by centrifugation. The cell pellet 
is solxibilized in the chaotropic agent 6 Molar Guanidine HCl. 
After clarification, solubilized protein is purified from 
this solution by chromatography on a Nickel -Chelate column 
under conditions that allow for tight binding by proteins 
containing the 6 -His tag (Hochuli, E. et al., J. 
Chromatography 411:177-184 (1984)). The protein is eluted 
from the column in 6 molcu: guanidine HCl pH 5.0 and for the 
piirpose of renatiiration adjusted to 3 molcur guanidine HCl, 
lOOitM sodium phosphate, 10 mmolar glutathione (reduced) and 
2 mmolar glutathione (oxidized) . After incubation in this 
solution for 12 hours the protein is dialyzed to 10 mmolar 
sodium phosphate. 

The protein purified in this manner may be used as an 
epitope to raise monoclonal antibodies specific to such 
protein. The monoclonal antibodies generated against the 
polypeptide the isolated protein can be obtained by direct 
injection of the polypeptides into an animal or by 
administering the polypeptides to cui animal. The antibodies 
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SO Obtained will then bind to the protein itself. Such 
antibodies can then be used to isolate the protein from 
tissue expressing that polypeptide by the use of an, for 
example, ELISA assay. 

Example 3 
Expression via Gene Therapy 

Fibroblasts are obtained from a subject by skin biopsy. 
The restilting tissue is placed in tissue-culture medium and 
separated into small pieces. Small chunlcs of the tissue cure 
placed on a wet surface of a tissue culture flask, 
approximately ten pieces are placed in each flask. The flask 
is turned iqpside down, closed tight and left at room 
temperature over night. After 24 hours at room ten5>erature , 
the flask is inverted and the chunks of tissue remain fixed 
to the bottom of the flask and fresh media (e.g.. Ham's P12 
media, with 10% FBS, penicillin and streptomycin, is added. 
This is then incubated at 37<*C for approximately one week. 
At this time, fresh media is added and subsequently changed 
every several days. After an additional two weeks in 
cxilture, a monolayer of fibroblasts emerge- The monolayer is 
trypsinized and sccQed into larger flasks. 

pMV-7 (Kirschmeier, P.T. et al, DNA, 7:219-25 (1988) 
flanked by the long terminal repeats of the Moloney murine 
sarcoma virus, is digested with ScoRl and Hindlll and 
siibsequently treated with calf intestinal phosphatase. The 
linear vector is fractionated on agarose gel and purified, 
using glass beads. 

The cDNA encoding a polypeptide of the present invention 
is amplified using PCR primers which correspond to the 5' and 
3' end sequences respectively. The 5' primer contains cm 
EcoRI site and the 3' primer contains a Hindlll site. Equal 
quantities of the Moloney murine sarcoma virus linear 
backbone and the BcoRl and Hindlll fragment cu:e added 
together, in the presence of T4 DNA ligase. The resulting 
mixture is maintained under conditions appropriate for 
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ligat^ion of the two fragments. The ligation mixture is used 
to trcinsform bacteria HBlOl, which are then plated onto agar- 
containing kanan^cin for the purpose of confirming that the 
vector had the gene of interest properly insert d. 

The amphotropic pA317 or 6P+ami2 packaging cells are 
grown in tissue culture to confluent density in Dulbecco's 
Modified Eagles Medium (DMBM) with 10% calf serim (CS) , 
penicillin cmd streptomycin. The MSV vector containing the 
gene is then added to the media and the packaging cells are 
transduced with the vector. The packaging cells now produce 
infectious viral particles containing the gene (the packaging 
cells are now referred to as producer cells) . 

Fresh media is added to the transduced producer cells, 
and subsequently, the media is harvested from a 10 cm plate 
of confluent producer cells. The spent media, containing the 
infectious viral particles, is filtered through a milllpore 
filter to remove detached producer cells and this media is 
then used to infect fibroblast cells. Media is removed from 
a sub-confluent plate of fibroblasts and quickly replaced 
with the media from the producer cells. This media is 
removed and replaced with fresh media. If the titer of virus 
is high, then virtually all fibroblasts will be infected and 
no selection is required. If the titer is very low, then it 
is necessary to use a retroviral vector that has a selectable 
marker, such as neo or his . 

The engineered fibroblasts are then injected into the 
host, either alone or after having been grown to confluence 
on cytodex 3 microcarrier beads . The fibroblasts now produce 
the protein product. 

Example 4 

Cloning and expression of colon specific gene polypeptide 
using the baculovirus expression system 

The DNA sequence encoding the full length protein, ATCC 
# 97129, was amplified using PCR oligonucleotide primers 
corresponding to the 5' ouid 3' sequences of the gene: 
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The 5' primer has the sequence 5' ATCGGGATCC6CCATCMS 
GCTTCCAGAAGCATGCB (SEQ ID NO: 5) and contains a BairiHI 
r striction enzyme site (in bold) followed by 6 nucleotides 
resembling an efficient signal for the initiation of 
translation in eukauryotic cells (Kozak, M., J. Mol. Biol., 
196:947-950 (1987) which was just behind the first 20 
nucleotides of the colon specific gene (the initiation codon 
for translation was underlined) . 

The 3' primer has the sequence 5' TACG6GTACCTTGCTC 
TATG(3TC3GGTAC 3' (SEQ ID NO: 6) and contains the cleavage site 
for the restriction endonuclease Asp718 and 5 nucleotides 
complementaa^r to the 3' non-translated sequence of the colon 
specific gene. The air?>lified sequences were isolated from a 
1% agcirose gel using a commercially available kit 
("Geneclean, " BIO 101 Inc., La Jolla, Ca.) . The fragment was 
then digested with the endonucleases BamHI and Asp718 and 
then purified again on a 1% agarose gel. This fragment was 
designated F2. 

The vector pA2 (modification of pVL941 vector, discussed 
below) was used for the expression of the colon specific 
protein using the baculovirus expression system (for review 
see: Summers, M.D. and Smith, G.E. 1987, A manual of methods 
for baculovirus vectors and insect cell culture proceduxes, 
Texas Agricultural Eaqperimental Station Bulletin NO: 1555) . 
This expression vector contciins the strong polyhedrin 
promoter of the Autographa calif omica nuclear polyhedrosis 
virus (AcMNPV) followed by the recognition sites for the 
restriction endonucleases BamHI and Asp718. The 
polyadenylation site of the simian virus (SV)40 was used for 
efficient polyadenylation. For an easy selection of 
recombinant viruses the beta-galactosidase gene from E.coli 
was< inserted in the same orientation as the polyhedrin 
promoter followed by the polyadenylation signal of the 
polyhedrin gene. The polyhedrin sequences were flanked at 
both sides by viral sequences for the cell -mediated 



-46- 



wo 96/39541 



PCT/US9S/07169 



homologous recombination of cotransfected wild-type viral 
DNA. Mcuiy other baculovirus vectors could be used in place 
of pRGl such as pAc373, pVL941 and pAcIMl (Luckow, V.A. and 
Summers, M.D., Virology, 170:31-39). 

The plasmid was digested with the restriction enzymes 
and then dephosphorylated using calf intestinal phosphatase 
by procedxxres known in the art. The DNA was then isolated 
from a 1% agarose gel using the commercially available kit 
("Geneclean" BIO 101 Inc., La Jolla, CaJ . This vector DNA 
was designated V2. 

Fragment F2 and the dephosphorylated plasmid V2 were 
ligated with T4 DNA ligase . DH5a cells were then transformed 
and bacteria identified that contained the plasmid (pBac- 
colon specific polypepitde) with the colon specific gene 
using the enzymes BamHI and Asp718. The sequence of the 
cloned fragment was confirmed by DNA sequencing. 

5 /ig of the plasmid pBac- colon specific gene was 
cotransfected with 1.0 fig of a commercially availcible 
linearized baculovirus ( "BaculoGold™ baculovirus DNA", 
Pharmingen, San Diego, CA.) using the lipofection method 
(Feigner et al. Proc. Natl. Acad. Sci. USA, 84:7413-7417 
(1987) ) . 

l^g of BaculoGold™ virus DNA cuid 5 /xg of the plasmid 
pBac-colon specific gene were mixed in a sterile well of a 
microtiter plate containing 50 fil of serum free Grace ' s 
medium (Life Technologies Inc., Gaithersburg, MD) . 
Afterwards 10 fil Lipofectin plus 90 /il Grace's medium were 
added, mixed and incubated for 15 minutes at room 
tenqperatxire . Then the transfection mixtxire was added 
dropwise to the Sf 9 insect cells (ATCC CRL 1711) seeded in a 
35 mm tissue cultxire plate with 1ml Grace's medium without 
serum. The plate was rocked back and forth to mix the newly 
added solution. The plate was then incubated for 27 hours at 
27®C. After 5 hours the transfection solution was removed 
from the plate and 1 ml of Grace's insect medium supplemented 
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with 10% fetal calf serum was added. The plate was put back 
into em incubator and cultivation continued at 27**C for four 
days. 

After four days the supernatant was collected and a 
plague assay performed similar as described by Summers and 
Smith (sxqpra) . As a modification an agarose gel with "Blue 
Gal" (Life Technologies Inc., Gaithersburg) was used which 
allows an easy isolation of blue stained plagues. (A 
detailed description of a "plague assay" can also be found in 
the user's guide for insect cell cultiire cuad baculovirology 
distributed by Life Technologies Inc., Gaithersburg, page 9- 
10) . 

Pour days after the serial dilution, the viruses were 
added to the cells and blue stained plagues were picked with 
the tip of an Bppendorf pipette. The agar containing the 
recombinant viruses was then resuspended in an Eppendorf tube 
containing 200 fil of Grace's medium. The agar was removed by 
a brief centrifugation and the supernatant containing the 
recombinant baculovirus was used to infect Sf 9 cells seeded 
in 35 mm dishes. Four days later the supematants of these 
culture dishes were harvested and then stored at 4^C. 

Sf9 cells were grown in Grace's medium supplemented with 
10% heat -inactivated FBS. The cells were infected with the 
recombinant baculovirus V- colon specific gene at a 
multiplicity of infection (MOD of 2. Six hours later the 
medium was removed and replaced with SF900 II wig^rH^wn minus 
methionine and cysteine (Life Technologies Inc., 
Gaithersbiirg) . 42 hours later 5 ;iCi of ^S -methionine and 5 
^Ci ^S cysteine (Amersham) were added. The colon specific 
protein was purified from infected cells 72 hours post 
infection by cell lysis in hypotonic phosphate buffer and 
fiirther purified by ion exchange chromatography, size 
exclusion chromatography, and reverse phase chromatography. 

Numerous modifications and vcuriations of the present 
invention were possible in light of the above teachings and. 
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therefore, within the scope of the appended claims, the 
dLnvention may be practiced otherwise than as particularly- 
described. 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(1) APPLICANT: LI, ET AL. 

(ii) TITLE OF INVENTION: Human Colon Specific Gene 

(iii) NUMBER OF SEQUENCES: 6 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: CARELLA, BYRNE, BAIN, 6ILFILLAN, 

CECCHI, STEWART & OLSTBIN 

(B) STREET: 6 BECKER FARM ROAD 

(C) CITY: ROSELAND 

(D) STATE: NEW JERSEY 

(E) COUNTRY: USA 

(F) ZIP: 07068 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: 3.5 INCH DISKETTE 

(B) COMPUTER: IBM PS/2 

(C) OPERATING SYSTEM: MS-DOS 

(D) SOFTWARE: WORD PERFECT 5.1 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: Concurrently 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA 

(A) APPLICATION NUMBER: , 

(B) FILING DATE: 
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(viii) ATTORNBY/AGKNT INFORMATION: 

(A) NAME: FBRRARO, GREGORY D. 

(B) REGISTRATION NUMBER: 36,134 

(C) REFERENCE/DOCKET NUMBER: 325800- 

TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 201-994-1700 

(B) TELEFAX: 201-994-1744 

INFORMATION FOR SEQ ID N0:1: 

SEQUENCE CHARACTERISTICS 

(A) LENGTH: 1114 BASE PAIRS 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS : SINGLE 

(D) TOPOLOGY: LINEAR 

MOLECULE TYPE: CDNA 

SEQUENCE DESCRIPTION: SEQ ID N0:1: 

GCACGAGGCC AAACAGATTT 6CAGATCAAG GAGAACCCAG GACTTTCAAA GAAGOGCTAG 60 

TAAGGTCrCT GAGATCCTTG CACTAGCTAC ATCCTCAGGG TAGGAGGAA6 ATGGCTTCCA 120 

GAAGCAT60G GCIGCTCCTA TT6CTGAGCT GCCTG6CCAA AAC3U3GAGTC CTGGGIGAXA 180 

TCATCATGAG ACCGAGCTGT GCTCCTGGAT GGTTTIACCA CAAGTCCAAT TGCXAT6GTT 240 

ACTTCAGGAA GCTGAGGAAC TGGTCTGATG CC6AGCT0GA GTGTCAGTCT TAOSGAAAOG 300 

GAGCCCACCT GGCSITCTATC CTGAGTTTAA AGGAAGCC31G CACCAXAGCA GAGTACATAA 360 

GTGGCTATCA GAGAAGCCAG CXX3ATATGGA TTGGCCTGC3V CGACCCACAG AA6AGGCAGC 420 

AGTGGCAGTG GATTGATGGG GCCATGTATC TGTACAGATC CTGGTCTGGC AAGTCCATGG 480 

GTGGGAACAA GCACTGTGCT GAGATGAGCT CCAAXAACAA CTTTTTAACT TGGAGCAGCA 540 

AOGAATGCAA CAA6CGCCAA CACTTCCTGT GCAAGTACC6 ACCATAGAGC AAGAATCAAG 600 

ATTCTGCTRA CTCCIGCACA GCCCOCSTCCT CTTCCTTTCT GCIA6CCIGG CXAAATCTGC 660 

TCATTATTTC AGAGGGGAAA CCTAGCSUUIC TAAGAGTGAT AAGGGCCCXA CXACACTGGC 720 

TTTTTTAGGC TTAGAGACAG AAACTTTAGC ATTGGCCC3W3 TAGTGGCTTC TAGCTCEAAA 780 

TGTTTGCCCC GCCATCCCTT TCCACAGTAT CCTTCTTCCC TCCTCCCCTG TCTCIGGCTG 840 

TCTCGAGCAS TCTAGAAGAG TGCATCTCCA GCCCATGAAA CAGCIGGGTC TTTGGCCATA 900 

AGAAGTAAAG ATTTGAAGAC AGAAGGAAGA AACTCAGGAG TAAGCTXCCA GACCCCTTCA 960 

GCTTCTACAC CCTTCTGCCC TCTCTCCATr GCCTGCACCC CACCCCAGCC ACTCAACTCC 1020* 



(ix) 

(2) 

(i) 

(ii) 
(xi) 
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T6CITGTTTT TCCTTTG6CC ATAGGAAGGT TTUCCAGTAG AATCCTTGCT AGGTTGATGT lOBO 
GGGCCATACA TTCCTTTAAT AAACCATTGT GTAC 1114 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGriH: 158 AMINO ACIDS 

(B) TYPE: AMINO ACID 

(C) STRANDEDNESS: 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PROTEIN 
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(xi) SEQUENCE DESCMPTION: SBQ ID NO: 2: 

Met Ala Ser Arg Ser Met Arg Leu Leu Leu Leu Leu Ser Cys Leu 

5 10 15 

Ala Lys Thr Gly Val Leu Gly Asp lie lie Met Arg Pro Ser Cys 

20 25 30 

Ala Pro Gly Trp Phe Tyr His Lys Ser Asn Cys Tyr Gly Tyr Phe 

35 40 45 

Arg Lys Leu Arg Asn Trp Ser Asp Ala Glu Leu Glu Cys Gin Ser 

50 55 60 

Tyr Gly Asn Gly Ala His Leu Ala Ser lie Leu Ser Leu Lys Glu 

65 70 75 

Ala Ser Thr lie Ala Glu Tyr lie Ser Gly Tyr Gin Arg Ser Gin 

80 85 90 

Pro lie Trp lie Gly Leu His Asp Pro Gin Lys Arg Gin Gin Trp 

95 100 105 

Gin Trp lie Asp Gly Ala Met Tyr Leu Tyr Arg Ser Trp Ser Gly 
110 115 ' 120 

Lys Ser Met Gly Gly Asn Lys His Cys Ala Glu Met Ser Ser Asn 
125 130 135 

Asn Asn Phe Leu Thr Trp Ser Ser Asn Glu Cys Asn Lys Arg Gin 
140 145 150 

His Phe Leu Cys Lys Tyr Arg Pro 
155 
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WHAT IS rr.aTMRn iS; 

1. An Isolated polynucleotide conqprlsing a member 
selected from the group consisting of: 

(a) a polynucleotide encoding the polypeptide 
conoprising amino acid 1 to amino acid 158 as set forth in 
SEQ ID N0:2; 

(b) a polynucleotide which encodes a mature 
polypeptide encoded by the DNA contained in ATCC Deposit 
No. 97129; 

(c) a polynucleotide capable of hybridizing to 
and which is at least 70% identical to the polynucleotide 
of (a) or (b) ; and 

(d) a polynucleotide fragment of the 
polynucleotide of (a) , (b) or (c) . 

2 . The polynucleotide of Claim 1 wherein the 
polynucleotide is DNA« 

3. The polynucleotide of Claim 2 which encodes the 
polypeptide conprising amino acid 1 to 158 of SEQ ID NO: 2. 

4. An isolated polynucleotide having at least 10 
base pairs which hybridizes to and which has at least 70% 
identity to a member selected from the groi^> consisting of: 

(a) RNA transcribed from a human gene, said gene 
including DNA having at least a 90% identity to the DNA 
selected from the group consisting of SEQ ID NO:l; and 

(b) DNA corresponding to the RNA of (a) . 

5. A vector containing the DNA of Claim 2. 

6. A host cell genetically engineered with the 
vector of Claim 5. 
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7. A process for producing a polypeptide comprising: 
expr ssing from the host cell of Claim 6 the polypeptide 
encoded by said DNA. 

8. A process for producing cells capable of 
expressing a polypeptide comprising genetically engineering 
cells with the vector of Claim 5 . 

9. A polypeptide encoded by the polynucleotide of 
claim 1 and comprising a member selected from the group 
consisting of (i) a polypeptide having the deduced amino 
acid sequence of SEQ ID NO: 2 and fragments, analogs and 
derivatives thereof; and (ii) a polypeptide encoded by the 
cDNA of ATCC Deposit No. 97129 and fragments, analogs and 
derivatives of said polypeptide. 

-10. An agonist for the polypeptide of claim 9. 

11. An antagonist against the polypeptide Of claim 9. 

12 . A method for the treatment of a patient having 
need to inhibit a colon specific gene protein comprising: 
administering to the patient a therapeutically effective 
amount of the antagonist of Claim 11. 

13 . The method of claim 12 wherein the antagonist is 
a polypeptide and the therapeutically effective amount of 
the antagonist is administered by providing to the patient 
DNA encoding SGd.d polypeptide and expressing said 
polypeptide in viw. 

14. A method for the treatment of a patient having 
need of a colon specific gene protein comprising: 
administering to the patient a therapeutically effective 
amount of the polypeptide of claim 9. 
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15. The method of claim 14 wherein the 
therapeutically effective amount of the polypeptide is 
administered by providing to the patient DNA encoding said 
polypeptide and expressing said polypeptide in vivo. 

16. A process of screening compounds to identify 
antagonists to the polypeptide of claim 9, Sciid procisss 
cooprising: 

combining the polypeptide with elements which 
undergo simultaneous oxidation and reduction in the 
presence of the compound to be screened under conditions 
favorable to an oxidation reduction reaction; ouid 

determining the ability of the compound to 
inhibit the reaction. 

17. A process for diagnosing a disorder of the colon 
in a host comprising: 

determining transcription of a hxaman gene in a 
sanple derived from non- colon tissue of a host, said gene 
having a coding portion which includes DNA having at least 
90% identity to DNA selected from the grou^ consisting of 
the DNA of SEQ ID N0:1, whereby said transcription 
indicates a disorder of the colon in the host. 

18. The process of claim 17 wherein transcription is 
determined by detecting the presence of an altered level of 
RNA transcribed from said hxmoan gene. 

19. The process of claim 17 wherein transcription is 
determined by detecting the presence of an altered level of 
DNA conqplementary to the RNA transcribed from said hinocui 
gene. 
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20. The process of claim 17 wherein transcription is 

determined by detecting the presence of an altered level of 
an expression product of said human gene. 
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